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The starting point of the present paper is BeU's notion of local causality and his own sharpening 
of it so as to provide for mathematical formalisation. Starting with Norsen's (2007 [I3], 2009 [ly|) 
analysis of this formalisation, it is subjected to a critique that reveals two crucial aspects that have 
so far not been properly taken into account. These are (i) the correct understanding of the notions of 
sufficiency, completeness and redundancy involved; and (ii) the fact that the apparatus settings and 
measurement outcomes have very different theoretical roles in the candidate theories under study. 
Both aspects are not adequately incorporated in the standard formalisation, and we will therefore 
do so. The upshot of our analysis is a more detailed, sharp and clean mathematical expression of 
the condition of local causality. A preliminary analysis of the repercussions of our proposal shows 
that it is able to locate exactly where and how the notions of locality and causality are involved in 
formalising Bell's condition of local causality. 
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Now it is precisely in cleaning up intuitive 
ideas for mathematics that one is likely to 
throw out the baby with the bathwater. 

J.S. Bell (1990) d p. 106] 



I. INTRODUCTION 

Despite the existence of many mathematically precise re- 
sults concerning Bell's theorem, there continues to be 
controversy over just what the ingredients of the theorem 
are, and what the theorem together with the experiments 
confirming a violation of a Bell-inequality rules out. This 
is especially so with regard the locality and causality re- 
quirements involved. Although recently great progress 
has been achieved in clarifying Bell's theorem and the 
crucial notions and assumptions involveqj, we still be- 
lieve there to be a crucial gap left open. This paper, 
which is still work in progress, tries to fill in this gap. 

In section |ll] we will present the intuitive notion of 
local causality and the way Bell himself further sharp- 
ened it so as to allow for mathematical formalisation. In 
section IIIII Bell's formalisation of local causality will be 
commented on using the illuminating papers by Norsen 
(2007, [H, 2009 |l6l). We wiU closely follow Norsen's 



analysiqj but in doing so we will unearth two novel and 
crucial aspects that have not yet been adequately incor- 
porated in the standard formalisation. Firstly, section 
nil Al indicates the intricate relationship of the notions of 
sufficiency, completeness and redundancy involved. Sec- 
ondly, the very different theoretical role of settings and 
outcomes in the candidate theories under study is argued 
for in section IIIIBI The latter can be rather easily per- 
formed but the first needs a rather extensive discussion. 
This is performed in section lTVl and which gives a mathe- 
matical account of the two different notions of sufficiency 
that are in play. It will use an important source of in- 
spiration that has been overlooked in the debate so far. 
Namely, it will be argued that the concept of sufficiency, 
first formulated by R.A. Fisher (1922 [13) in the context 
of mathematical statistics throws a relevant light on this 
debate. 

Next, the tools obtained in the course of our analy- 
sis are used in section |V] to finally give a mathemati- 
cally sharp and clean formulation of Bell's notion of local 
causality. In section IVll the novel mathematical formali- 
sation is analysed and it is precisely indicated where and 
how the notions of locality and causality are involved. 



Section IVIII marks the end of this paper by indicating 
that, indeed, the present paper is still work in progress, 
as the repercussions of our mathematical formalisation 
of Bell's notion of local causality need still to be fully 
charted and confronted to those of other similar analy- 
siS. 
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^ For example, see Cavalcanti (2008, 
[13), Seevinck (2008 [13). 



[nil), Norsen (2007 [Q, 2009 



^ Parts of the present paper merely rehearse aspects of Norsen's 
analysis. See foonote |9] for the reason of doing so. 

^ This we hope to do in the near future (Seevink & Uffink, 2010, 
[II). 



II. THE INTUITIVE IDEA: BELL'S LOCAL 
CAUSALITY 

In the section entitled 'Principle of local causality' of the 
very last article Bell wrote on the foundations of quan- 
tum theory (published in 1990 and entitled 'La Nouvelle 
Cuisine' Q), Bell begins his explanation of the principle 
of local causality as followsQ 

"The direct causes (and effects) of events are 
near by, and even the indirect causes (and ef- 
fects) are no further away than permitted by 
the velocity of light." Bell (1990) [9, p. 105] 




"A theory is said to be locally causal if the 
probabilities attached to values of local be- 
ables in a space-time region 1 are unaltered by 
a specification of values of local beables in a 
space-like separated region 2 when what hap- 
pens in the backward light cone is already suf- 
ficiently specified, for example by a full speci- 
fication of local beables in a spacetime region 
3. It is important that region 3 completely 
shields off from 1 the overlap of the backward 
light cones of 1 and 2. And it is important 
th at events 3 be specified completely. Other- 
wise the traces in region 2 of causes of events 
in 1 could well supplement whatever else was 
being used for calculating probabilities about 
1. The hypothesis is that any such informa- 
tion about 2 becomes redundant when 3 is 
specified completely." Beh (1990) d, p. 106] 

Although this formulation is considerably sharper, it 
is not yet cleanly formulated in terms of mathematics. 
Probably for this reason Bell introduces some further no- 
tation and terminology in a subsequent discussion. He in 
effect introduces the space-time diagram of Fig. [3] that 
is adapteqj from Norsen's (2009) [la] highly illuminating 
paper. 



FIG. 1. "Space-time location of causes and effects of events in 
region 1." Figure (slightly modified) and caption taken from 
Bell (1990 d, p. 105]). 

This locates the causes operating in a certain region in 
space-time in the backward light cone of that region and 
effects of anything occuring in that region in its forward 
light cone. See Fig.[TJ But Bell remarks, the "[t]he above 
principle is not yet sufficiently sharp and clean for math- 
ematics". He then continues (see Fig. [2]): 




FIG. 3. Space-time diagram of the setup Bell considers. For 
explanation, see text. Figure adapted from Norsen (2009) 

m 




FIG. 2. "Full specification of what happens in 3 makes events 
in 2 irrelevant for predictions about 1 in a locally causal the- 
ory." Figure and caption taken from Bell (1990) 9,, p. 105]. 



* Here we will mainly focus on Bell's formulation of this principle 
as presented in 'La Nouvelle Cuisine', Bell (1990) 9|. This pre- 
sentation we take to be the most definite and precise one Bell 
ever presented; it is overall consistent with earlier formulations 
Bell used to indicate this principle. See Norsen (2007) 15] for 
further elaboration and support of this claim. 



This diagram encodes the setup Bell considers. It in- 
volves measurement on a bi-partite system (e.g., two par- 
ticles emitted by a source) where each part is measured 
by a different party, called Alice and Bob respectively. 
The outcomes of measurement are represented by be- 
ables A (in region 1) and B (in region 2) and the settings 
chosen by experimenters Alice and Bob are denoted by 
beables a and b respectively. The symbol A indicates the 
specification of the state of the bipartite system under 
study together with other relevant beables in the space- 
time regions 3a and 3b. 

The logic is now as follows. Consider a candidate the- 
ory that attempts to describe any correlations found be- 
tween outcomes A and B. Suppose region 3a shields off 
region 1 from the overlap of the past light cones of 1 and 



See footnote [ 



2, and, likewise, that region 3b shields off region 2 from 
the overlap of the past light cones of 1 and 2 (see Fig. [3]). 
It is assumed that (in this candidate theory under study) 
A constitutes a complete specificatiorO of the beables in 
region 3a and 3b. 

With all this implicitly in place, Bell continues and 
applies his principle of local causality to this setup: 

"Invoking local causality, and the assumed 
completeness of ... A ... we declare redun- 
dant certain of the conditional variables in 
the last expression because they are at space- 
like separation from the result in question." 
Bell (1990) la p. 109] 

Thus the specification of A makes both B and b redundant 
for prediction about A, and both A and a redundant for 
prediction about B. 

This finally allows for a clean formulation in mathe- 
matics of the principle. For now we follow Norsen (2007) 
|15| in claiming that this indeed gives (but, see our cri- 
tique later on) 



PiA\a,b,B,X) =P{A\a,X), 
P{B\a,b,A,X) =P{B\b,X), 



(1) 



i.e., the conditional probability of obtaining A is inde- 
pendent of both B and b given the specification A and 
a, and analogous for the probability of obtaining B. Us- 
ing the definition of conditional probability one trivially 
obtains the condition 



P{A, B\a, b, A) = P{A\a, A) P{B\b, A) 



(2) 



i.e., the joint probability for obtaining outcomes A and 
B factorizes into a product of individual probabilities 
for the two spatially separated systems, with each fac- 
tor containing conditionalization only on local beables. 
This well-known factorisation condition is thus derived 
from the principle of local causality, just as Bell himself 
stressecQ. 

In the following we will subject this reasoning and 
mathematical formalisation to a critique. We believe that 
Bell's qualitative statement in the long quote above can 
benefit greatly from a more refined and detailed mathe- 
matical discussion than available in the literature, espe- 
cially concerning (i) the understanding of the notions of 



S Norsen (2007 [H, 2009 [3) requires that such a complete spec- 
ification of region 3a and 3b not only includes A but also the 
setting a in region 3a and setting b in region 3b, respectively. 
For this purpose he has appropriately located the spacetime lo- 
cation of settings to overlap with region 3 in Fig. [3] We believe, 
however, that this need not be done, and in fact should not be 
done. In remark (two) on Dage ll2l this is further explained. 

^ "Very often such factorizability is taken as the starting point 
of the analysis. Here we have preferred to see it not as the 
formulation of "local causality" , but as a consequence thereof." 
Bell (1990) H p. 109] 



sufficiency, completeness and redundancy involved; and 
(ii) the fact that the settings a, b and outcomes A, B have 
very different theoretical roles in the candidate theories 
under study, something which, we will argue, is not ade- 
quately refiected in the reasoning leading up to ^. The 
upshot of our critique is a more detailed, sharp and clean 
mathematical expression of local causality. It will fur- 
thermore be showrlj that this pays off when interpreting 
this condition. 



III. CLEANING UP THE INTUITIVE IDEA 

Let us first comment on some crucial aspects of Bell's 
formulation of local causality, most of which can already 
be found in the literaturqj. Here we will be deliberately 
short because our main point lies elsewhere as will soon 
become clear. When possible we will quote Bell so as to 
let the 'master speak for himself. For a more detailed 
discusion of these points, see more of Bell himself (Bell, 
1976 [1], 1977ji, 1981 fsl, 1990 [9^), and especially 
Norsen (2007 [ij 2009 [16]). 

(i) It is important to note that the condition of 
local causality is only intended to be a constraint on 
candidate theories^ and not on the real world. Indeed, 
Bell starts by writing: "A theory is said to be locally 
causal if ... " [emphasis added] . Furthermore, as Norsen 
(2009, [Tg]) has pointed out. Bell has emphasized this 
point very clearly in (Bell, 1977 d p. 101]): 

"I would insist here on the distinction be- 
tween analyzing various physical theories, on 
the one hand, and philosophising about the 
unique real world on the other hand. In this 
matter of causality it is a great inconvenience 
that the real world is given to us once only. 
We cannot know what would have happened 
if something had been different. We cannot 
repeat an experiment changing just one vari- 
able; the hands of the clock will have moved, 
and the moons of Jupiter. Physical theories 
are more amenable in this respect. We can 
calculate the consequences of changing free 
elements in a theory, be they only initial con- 
ditions, and so can explore the causal struc- 
ture of the theory. I insist that [local causal- 
ity] is primarily an analysis of certain kinds 
of physical theory." 

Note that the fundamental concepts involved such as 
'beables', 'completeness', and 'free variables' are all 



* Here only a preliminary investigation of this is worked out. A full 
analysis of the repercussions of our proposal is to be presented 
in future work, see Seevinck & Uffink (2010, 18]). 

^ Nevertheless, most of these aspects are not well-known nor gen- 
erally appreciated by commentators. It is thus worthwhile men- 
tioning them here. 



relative to some particular candidate theory, 
be become clear next. 



This will 



(ii) Bell uses the term 'beable' to denote what- 
ever is posited by the candidate theory to correspond to 
something physically real: 

"The beables of the theory are those elements 
which might correspond to elements of reality, 
to things which exist. Their existence does 
not depend on observation. Indeed observa- 
tion and observers must be made out of be- 
ables. I use the term 'beable' rather than 
some more committed term like 'being' or 
'beer' to recall the essentially tentative na- 
ture of any physical theory. Such a theory 
is at best a candidate for the description of 
nature. Terms like 'being', 'beer', 'existent', 
etc., would seem to me lacking in humility. In 
fact 'beable' is short for maybe-able'." (Bell, 
1984 @, p. 174]) 

The concept 'beable' is thus theory-relative, and it 
is important that the candidate theory in question is 
absolutely clear about what it posits as physically real 
(Norsen, 2007 [13 )• Indeed, Bell emphasizes '...you 
must identify in your theory 'local ^cables'. The beahles 
of the theory are those entities in it which are, at least 
tentatively, to be taken seriously, as corresponding 
to something real." (Beh, 1990, [g p. 100 ]). And, 
" [i]t is in terms of local beables that we can hope to 
formulate some notion of local causality." (Bell, 1976, 
[3, p. 53]). When applied to our particular setup of 
Fig. [3] this implies that the candidate theory in question 
must provide a well-specified account of the beables A 
in region 3. It should be noted that A is not resticted 
in any way, as it can be anything the theory posits 
as physically real. In particular A need not be some 
classical hidden variable. 

(iii) It is important that in Fig. [2] "[r]egion 3 
completely shields off from 1 the overlap of the backward 
light cones of 1 and 2." (Bell, 1990 [I, p. 106]). Likewise, 
in the paradigmatic setup of Fig. [3] it is necessary that 
region 3a shields off region 1 from the overlap of the 
backward light cones of 1 and 2, and, analogously, 
region 3b shields off region 2 from the overlap of the 
backward light cones of 1 and 2 (see Fig. [3]). Why? For 
if this was not the case, such as for region 3' in Fig. HI 
then a violation of ([Ij fails to indicate the presence of 
some sort of non-local causation. In short, consider for 
example an event X3 in Fig. 0] located in the overlap 
of the backward light cones of regions 1 and 2 but in 
the forward light cone of region 3'. Since X3 lies in the 
overlap of the backward light cones of regions 1 and 2, 
it can influence both A and B. Now suppose X3 is a 
genuinely stochastic event, not predictable on the basis 
of the beables in region 3', then specification of events in 
region 2 could tell about X3, which in turn, could allow 



one to infer more about the events in 1 than is possible 
from just the original specification of 3'. 

The condition (HJ with region 3 replaced by region 3' 
-to be called called ((!])- would exclude any correlation 
between events b, B and the outcome A, given a and A. 
But a failure of this condition could be perfectly com- 
patible with local causality. Thus although ([TI) "may 
validly be described as a "no correlations" condition for 
regions 1 and 2, it definitely fails as a "no-causality" 
condition." (Norsen, 2007, [ll p. 12]). 




FIG. 4. Alternative space-time diagram of the Bell-type 
setup. For explanation, see text, and compare to Fig. [S] 
Figure adapted from Norsen (2009) [l^ . 



(iv) Bell takes it to be a trivial point that be- 
sides A, both the settings a, h and the outcomes A^ B are 
also beableso: "The beables must include the settings of 
switches and knobs on experimental equipment [settings] 
... and the readings of intruments [outcomes]." (Bell, 
1975, 0, p. 52]. Indeed, the settings correspond to the 
controllable part of some measurement apparatus and 
the outcomes can be taken to be manifested by the final 
position of some pointer (or something similar, such as 
a black spot on a photograph, etc.) and these are all 
beables as they must, in any serious candidate theory, 
correspond to "something physically real" (Bell, 1990 [3, 
p. 100]). 

Thus, "[the] "setting" ultimately comes down to the 
spatial configuration of some physically real matter, [and] 
. . . the outcome too is just a convenient way of referring 
to some physically real and directly observable configu- 
ration of matter, and so [these too] will necessarily be 
reflected in the beables posited by any serious candidate 
theory" (Norsen, 2009 [H p. 5]). 

There is, however, also a very important difference be- 
tween settings and outcomes that breaks the symmetry 
described above. This is a consequence of the fact that, in 
contradistinction to the outcomes, the settings are sup- 
posecO to be uncorrelated to the beables A. The rea- 



^^ A point also made by Hans Westman, private communication. 
^^ This requirement is crucial in deriving the so-called Bell- type 
inequalities that are used to proof Bell's theorem (Bell, 1964 



son being that "Now we make an important hypothesis: 
The variables a and b can be considered to be frefn. or 
random." (Bell, 1990, 0, p. 109]). Bell continues: "we 
can imagine these settings being freely chosen at the last 
second by two different experimental physicists, or some 
other random devices. If these last second choices are 
trucly random, they are not influenced by the variables 
A. Then the resultant values for a and b do not give any 
information about A. So the probability distribution over 
A does not depend on a or &", i.e.. 



p{X\a,b) ^ p{X). 



(3) 



with p some density. 

This 'free variables' assumption has the important 
repercussion that, despite the fact that from a physical 
point of view outcomes and settings are nothing but 
beables, they do have a completely different theoretical 
role to play in the candidate theories in question. In the 
literature this crucial difference has not been correctly 
incorporated in the mathematical formalisation of Bell's 
notion of local causality. One of the main points of the 
present paper is that this will be performed here (in 
section IIII Bl and beyond) . 

(v) Consider the settings a' and b' that are lo- 
cated in region 4 of Fig. |31 Since region 3' (just like 
region 3 of Fig. [3J by the way) shields off both region 
1 and 2 from the settings a' and b' that are located in 
region 4, we expect that if indeed the beables A' in region 
3' are completely specified that the beables of region 4 
would be redundant for prediction of events in region 1 
or 2. The reason being that local causality is supposed 
to enforce that a complete specification of A' renders the 
events in the backward light cone of region 3' redundant 
for predictions of events in the future light cone of 3'. 
Indeed, this is precisely what Bell's intuitive formulation 
claims (see the very first quote by Bell in section |lll 
and also Fig. [T|). Therefore, under this understanding of 
local causality, and where we take A' to be completely 
specified, we expect that ([T]) reduces to 



P{A\a',b',X')^P{A\X), 
P{B\a',b',\') = P{B\\'). 



(4) 



However, this ignores the fact that the settings a' and 
6' are by assumption independent of the beables A' of re- 
gion 3' (see the remark (iv) above). Therefore, if one 
takes this into account, we expect the most complete 
specification of A' not to shield off from region 1 and 
2 the causal influences a' and b' have on these regions. 

Thus, we in fact expect (JH) to be violated, i.e., both 
a' and h' could very well (in a locally causal way) sup- 
ply information about both A and B that is not already 



contained in A'. Yet, since a', b' lie in the backward light 
cone of both region 1 and 2 this would in no way indicate 
non-local causation. 

In order to exclude such spurious violations of ([T|) it 
must be the case that the settings should be spacelike 
separated and outside the overlap of the backward light 
cones of regions 1 and 2 (i.e., outside region 4 in Fig. U), 
and in most discussions this is indeed the case, e.g., in 
Fig. El 

Note that the main point here is not to argue for the 
settings to be located as in Fig. [3] rather than as in Fig. 
|3]-for that is a rather obviouJ^ if one wants to test local 
causality-, but that this is necessary to exclude spurious 
violations of ([IJ. 

(vi) The previous remark (i.e., (v)) serves as a 
starter for the following subsequent discussion. For 
that remark shows not only that settings should have 
a very particular space-time configuration in a sound 
formulation of local causality, it also indicates that the 
supposition that A' could be a complete specification of 
the beables in region 3' of Fig. 0] is an illusion] and the 
same holds for A in region 3 of Fig. |31 This is explained 
next. 

Indeed, we can not but give up the idea that A is a com- 
plete specification of beables in region 3, for otherwise 
"one wonders how a and b could possibly not be causally 
influenced by A (in a locally causal theory)." (Norsen, 
2009 [ia, p. 283]). In other words, A cannot be ex- 
pected to be a complete specification of region 3 because 
one must allow for the possibility of traces in region 3 of 
the causal past of both the settings, and given the inde- 
pendence of A and the settings, these traces cannot be 
included in X^ . 

An alternative understanding of this point is that one 
is here faced with a dilemma. That is, the following two 
assumptions cannot both hold: (i) the free variables as- 
sumption, and (ii) the assumption that A is completely 
specified, i.e., contains the description of all and every 
beable in region 3. However, this dilemma can in fact 
be avoided. Because, luckily, there is no need for a com- 



i). 

^^ Bell also calls them "free variables — say at the whim of experi- 
menters". (Bell, 1976 H p. 61]) 



^^ The settings a, 6 need not lie in the forward light cone of region 
3, all that is necessary is that they lie outside the overlap of the 
backward light cones of region 1 and 2. 

^* "For example, some candidate theory (and this is actually true 
of every serious extant candidate theory) might provide a spec- 
ification of the state of the particle pair which is sufficient in 
the relevant sense, even though it leaves out some fact (say, the 
millionth digit of the energy of some relic microwave background 
photon that happens to fly into the detection region just prior 
to the measurement) which actually exists in the relevant space- 
time region. Such a fact could then be allowed to determine 
the setting a without introducing even the slightest evidence for 
the problematic sort of correlation between a and A. Indeed, 
this is just an exaggerated version of what happens in the actual 
experiments, where carefully-isolated and independent pseudo- 
random-number generators are used to produce the settings at 
the two stations." (Norsen, 2009, [H, p. 283]) 



pleteness requirement when formalising the notion of lo- 
cal causality. It is only needed that the specification is 
sufficiently specified, in the relevant sense. This will be 
futher worked out in the next subsection. 

Although Bell seemed to be aware of the crucial dis- 
tinction between a sufficient and complete specification of 
the beables involved (see next subsection), he neverthe- 
less repeatedljO stressed the need for A to be a complete 
specification of the beables in region 3. The reason for 
this is the following. 

Suppose the events in region 3 are not completely spec- 
ified, i.e., A leaves out some relevant beables. Then a vi- 
olation of dD) can no longer be used to infer some sort of 
non-local causation, for it could well be that the beables 
excluded from A influence both regions I and 2 in a locally 
causal way such that correlations between 1 and 2 can no 
longer be explained by an appeal to A alone. In order to 
exclude such spurious violations of ((IJ, Bell requires the 
specification of region 3 by A to be complete. Norsen 
(2007, [l^ p. 8]) emphasizes that the key assumption by 
Bell is that events be specified completely^. "It is only 
because A is assumed to be a complete description, that 
the non-dependence of the probability of A on the distant 
outcome B follows from local causality." 

One final remark concerning the completeness (or 
sufficiency; see next subsection) that is at stake. This 
notion is not to be understood in the sense of the 
true maximal amount of knowledge concerning the 
systems under study. To the contrary, it is relative 
with respect to possible candidate theories. Therefore, 
the question whether the candidate theory in question 
is complete in the sense of including 'all that really 
exists' is not at stake (there is no need for 'omniscience', 
see Norsen (2007, |15!, p. 8]). It thus is the candidate 
theory in question that should indicate when the desired 
completeness (or sufficiency) is achieved. 



^^ Most notably this is the case in the formulation of local causality 
as given in sectionllTlabove: "And it is important that events 3 be 
specified completely." (Bell 1990 [91, p. 106]), and "Invoking local 
causality and the assumed completeness of c and A, ..." (Ibid, 
p. 109). But also already in 1975 in 'The theory of local be- 
ables' Bell requires this form of completeness: "However, in the 
particular case that [A] contains already a complete specification 
of beables in the overlap of the two light cones, supplementary 
information from region 2 could reasonable be expected to be 
redundant." (Bell, 1975, [1 p. 54]). And in 'EPR correlations 
and EPW distributions'. Bell writes: "In a locally-causal theory, 
probabilities attached to values of local beables in one space-time 
region, when values are specified for all local beables in a second 
space-time region fully obstructing the backward light cone of 
the first, are unaltered by specification of values of local beables 
in a third region with spacelike separation from the first two." 
(Bell, 1986, 0, p. 200]). 

But in a different paper Norsen (2009, |l6ll ) mentions that, 
strictly speaking, such a form of completeness is not at all needed. 
See next subsection where this is discussed. 



The remarks (iv) and (vi) indicate that crucial ele- 
ments of Bell's condition of local causality have escaped 
careful mathematical formalisation. Most notably, the 
theoretical distinctions between settings and outcomes 
and between the notions of sufficiency and completeness 
have as of yet not been properly implemented. 

In the next two subsections these two issues are to be 
further worked out so as to provide the means for a clean 
and sharp mathematical formalisation of the notions at 
stake; something which is to be undertaken in section 
rvl This fills in the gap that was left untouched in the 
analysis of others^notably the careful analysis by Norsen 
(2007 ii^, 2009 tl6i). 



A. On sufficiency and completeness 

The above remarks (v) and (vi) indicate that A cannot 
be expected to be a complete specification of region 3 
because one must allow for the possibility of traces in 
region 3 of the causal past of the settings, and given the 
independence of A and the settings, these traces cannot 
be included in A (see footnote fT4)) . 

Bell seems to account for this by indicating, in his for- 
mulation of local causality as given in section [III that 
the specification must be "sufficiently specified, for ex- 
ample by a full specification of local beables" [emphasis 
added]. Indeed, a complete specification is not necessary, 
but merely sufficient to interpret (1) as a condition of lo- 
cal causality. However, and this is the main point, we 
have just seen that such a "complete specification" is not 
an option. It is the "sufficient specification" that can at 
most be required. We must thus focus our mathematical 
formalisation on this notion of sufficiency. 

Although both BclO (1990, 3) and Norsen (2009, 
[16] ) indicate that completeness is not necessary, but only 
sufficient, in the main discussion of the notion of local 
causality they nevertheless gloss over this and assume 
that A provides a complete specification of the beables 
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^^ Apart from the citation on page [2] above ("...is already suffi- 
ciently specified, for example ..."), taken from La Nouvelle Cui- 
sine (Bell, 1990, [3]), at at least two other occassions Bell men- 
tions that the notion of sufficiency is needed when formalising the 
notion of local causality: "Consider, then, the hypothesis that 
A and B fluctuate independently when the relevant causal fac- 
tors, at time T — S — e say, whatever they may be, are sufficiently 
well specified . . . ." [...]. That is assume there are variables A and 
some probability distribution 5 such that (2) holds, [where (2) is: 
p(A,B\a,b) = J f d\d^ia{X,iJ,)pi(A\a,\)p2{B,\b,n)] (Befl 1980, 
[3, p. 106]) [emphasis added]. And, "It seems reasonable to ex- 
pect that if sufficiently many such causal factors can be identi- 
fied and held fixed, the residual fluctuations will be independent, 
i.e., P{M,N\a,b,X) = Pi{M\a, X)P2{N\b,X), where [. ..] A de- 
notes any number of other variables that might be relevant. (Bell 
(1981), 0, p. 152] (flrst emphasis added). However, despite all 
this, Bell did not carefully distinguish between these two notions, 
and in fact seemed to prefer the requirement of completeness. See 
footnote 1151 above. 



in region 3. Here we want to avoid this, and therefore 
will proceed to give a sound analysis of the notion of 
sufficiency involved. 

It might be true that "Once one realizes that the no- 
tion of a complete specification of beables is relative to a 
given candidate theory, there is no further problem un- 
derstanding the meaning of "complete" or "full" .... But 
it is less clear, even given some well defined candidate 
theory, what partial specifications of the beables might 
be considered "sufficient"." (Norsen, 2007, [H, p. 16]) 
and that "[i]n a more careful discussion the notion of 
completeness should perhaps be replaced by that of suf- 
ficient completeness for a certain accuracy, with certain 
epsilonics." (Bell, 1977, 0, p. 104]). But this is a prac- 
tical problem that one encounters when formulating ad- 
equate candidate theories; it is not a problem for math- 
ematically formalising the notion of local causality. The 
reason being that we may assume that a serious candidate 
theory provides us with an unambiguous specification of 
the causally relevant beables. Otherwise the question of 

whether the theory is locally causal does not even come 
Isl 



Combining the above points, a first crucial observation 
is the following. The notion of sufficiency should be rela- 
tivized with respect to a specific subclass of all beables in 
the candidate theory, namely the subclass of all beables 
in a region 3 except for the free variables and the beables 
causally related to the latter in that region. 

We thus isolate a very important aspect of the notion 
of sufficiency involved. It is 

(a): relative to a specific class R\ of beables (i.e., the 
beables have a particular space-time specification, 
and they need not include all such beables the 
theory in fact allows for!). 

The next crucial observation comes from considering the 
following question: "A is sufficient for w/iai?" The first 
obvious answer may be: sufficient for prediction of out- 
come A or B. However, this cannot be true, as the prob- 
abilities assigned to region 1 need not be determined by 
A and the settings alone. Let us illustrate this with an 
example. 

Since we are allowing genuinely stochastic theories 
there could be stochastic causal relevant factors above re- 
gion 3 and within the backward light cone of 1 . Consider 
for example some such causal factors located at events x i 
or X2 in Fig. U) It is perfectly legitimitq^ in a locally 
causal theory that these events are allowed to supplement 



^* See Norsen (2007, [151 . p. 10]) for a similar analysis that shows 
that it should be trivial to decide in a given candidate theory 
what is and what is not a causal influence. This is endorsed by 
Cartwright & Jones (1991, [13, p. 229]) who write "... but we 
take the point to be perfectly general: there are no tests of [local] 
causality outside of models which already have significant causal 
structure built in." 

^^ See Norsen (2009, Il6l p. 12]) for an extensive discussion of the 
following point: "The claim made in the local causality condition 



A in determining the probabilities of events in regions 1 
or 2 respectively, i.e., Xi influencing region 1 and X2 
influencing region 2. 

The correct answer to our question "sufficient for 
what?" is that the specification A should be sufficient for 
rendering B and b redundant for the task of specifying the 
probability of outcome A occurring. Indeed, Bell writes: 
"The hypothesis is that any such information about 2 
becomes redundant when 3 is specified completely". As 
we have just seen, 'completely' should here be replaced 
by 'sufficient' or 'sufficiently complete'. 

We thus isolate another very important aspect of the 
notion of sufficiency involved. The specification A should 
be sufficient 

(/?i): for a specific purpose, namely 

(/?2): to render some other variables redundant for the 
task of determining some particular quantity. 

In the next section the notion of sufficiency as spelled 
out via the requirements a, /3i and ^2 will be properly 
mathematically formalised. But before we can do so a 
final necessary preparatory analysis needs to be presented 
in the next subsection. 



B. The different theoretical role of 
settings and outcomes 

In the mathematical formalisation above, (i) the out- 
comes A, B, (ii) the beables A and (iii) the settings a, b 
appear as conditioning arguments in a probability dis- 
tribution. See e.g. Eq. ([1]). However, if one treats the 
settings a and b as conditioning arguments in a probabil- 
ity distribution, this implies, at least in Kolmogorovian 
probability theory, that they are random variables, and 
thus a probability distribution over their possible values 
is defined within the model: one cannot write p{x\y) un- 
less p{y) is also defined. In other words, this means that 
the candidate theory in question would have to specify 
how probable it is that Alice will choose one setting oi 
rather than 02 , and similarly for Bob and for their joint 
choises. 

But that would be a remarkable feat for any physi- 
cal theory. Even quantum mechanics leaves the question 
what measuremtn is going to be performed on a system as 
one that is decided outside the theory, and does not spec- 
ify how much more probable one measuremeny is than 
another. It thus seems reasonable not to require from 
the candidate theories that they describe such probabil- 
ities. This explains, we hope, the different theoretical 



is not that the probabilities assigned (to events in region 1, on 
the basis of complete information about region 3) are the "best 
possible" probabilities the theory allows. They aren't. Better 
ones might be assigned, e.g., if we move region 3 forward in 
time, into the more recent past of region 1." 
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status of settings a, b and random variables A, B, X. And 
lest one should despair that this distinction introduces a 
dichotomy not introduced by Bell, we note that Bell also 
attributed a special status to a, & by relegating them the 
status of 'free variables', meaning that their values could 
be set or changed at the last instant. See above, section 
lllli remark (iv). This view, we believe, should also count 
as an argument against the presumption that a putative 
candidate theory ought to provide their probabilities. 

However, above f section Hill remark (iv)) it was also 
mentioned that from a fundamental point of view both 
settings and outcomes are nothing but beables. But we 
have just indicated that they have very different roles 
in any putative candidate theory we envisage, and this 
means that we should not regard them on equal foot- 
ing, at least theoretically. Although their ontological (or 
physical) status might be the same, their theoretical sta- 
tus is not. And this is crucial. Especially since local 
causality refers to putative candidate theories only (see 
section Hill remark (i)). Thus, the difference between 
the two must be adequately reflected in any candidate 
theory. 

Therefore we adopt the following point of view. Out- 
comes A, B, as well as the beable specification A are ran- 
dom variables and figure as arguments of a joint proba- 
bility distribution Pa.b{^, B, A) = PaA"^^ B\X) p(A). The 
measurement settings a, b appear as labels of this proba- 
bility function, not as arguments. The fundamental con- 
ditional probabilities to be considered are thus 



PaMA,B\X), 



(5) 



instead of Beh's P{A,B\a,b,X). Also, Eq. (O encod- 
ing the 'free variables' assumption should in fact read: 
Pa,bW = P(A). 



of declaring x redundant for task of determining the func- 
tion fxiy), relative to the class of all y in some specific 
range Ry , then, given y the function / does not differ for 
different values of a;, i.e., 



fx{y) = fiy), Vy e Ry, Vx. 



(6) 



This can of course trivially be generalized to func- 
tions fxi,x2,..Xyijy2^ ■ ■ •) that have more than one label 
xi,X2, ■ ■ ■, and more than one argument j/i, j/2, ■ • ■• 

Recall that our preliminary analysis of sufficiency in 
section fill Al showed that such a requirement for physical 
theories implies that certain variables should be sufficient 
for a particular purpose and relative to a specific class of 
variables. See the requirements a, /3i and ^2 on page 
[7| This is retained here: y is sufficient for the purpose 
of making f^ (y) independent of x (thus not sufficient for 
determining its value, or for some other purpose), and 
this is relative to a specific class, namely to all such y 
that lie in a given range Ry. 

Applying this condition to the usual Bell-type frame- 
work we have been sketching in the previous section is 
rather easy. First of all, we will suppose the condi- 
tion ^ to hold for each of the probability distributions 
Pa.b{A,B\X), Pa,b{A\B,X), etc. Secondly, A will play the 
role of y, and thus R\ the role of Ry, and, thirdly, the 
settings a, b play the role of the labels x. 

Let us next turn to sufficiency in the case of statisti- 
cal dependence P{-\x,y), and that requires considerably 
more clarificatory effort. 



B. Sufficiency in statistical inference 



IV. INTRODUCING MATHEMATICS: 
FORMALIZING SUFFICIENCY 

Then how are we to mathematically implement Bell's 
idea of "A being sufficiently specified so as to de- 
clare redundant some of the conditional variables" in 
Pa,b{A,B\X), where the latter are in fact to range over 
both the labels a, b and the random variables A, Bl This 
we will perform next. It is important to realize that 
two distinct notions of sufficiency are in play, i.e., where 
firstly the label b (or a), and, secondly, the random vari- 
able B (or A) becomes redundant for the task of deter- 
mining the specific probability to obtain outcome A (or 
B). Each of these two notions will be clarified next. 



A. Functional sufficiency 

The first kind of sufficiency where one of the labels a, b 
becomes redundant can be rather easily formalized. Con- 
sider a function fx{y) where x labels the different func- 
tions fx{y)- When variable y is sufficient for the purpose 



The concept of sufficiency in the context of the theory of 
statistical inference was developed by R.A. Fisher (1922) 

The basic problem of statistical inference may be for- 
mulated as follows. Suppose we have some probabilistic 
experiment with a fixed set of possible outcomes x ^ X 
and a family of probability distributions pe, 9 £ Q, each 
of which provides some candidate description for the ex- 
periment. Here, Q represents some arbitrary index set. 
For each value 9 G Q, pe{x) then provides the probabil- 
ity of X to occur. It is assumed however that we do not 
know exactly what the correct probability distribution is 
for the experiment and the problem is to infer something 
about which probability distribution out of the given col- 
lection would provide a best "fit" for the experiment on 
the basis of recorded outcomes. 

It is generally useful to present the problem from a 
slightly expanded version, by adding the supposition that 
it is possible to repeat the performance of the experiment 
under i.i.d conditions (i.e., independent and identically 
distributed repeated trials). In that case, assuming one 
performs n such trials, the probability of obtaining a se- 



quence (a;i, . . . , x„) is 



Peixi 



c„) = ]^P9(a;i)- 



(7) 



The goal now becomes to make an inference about 9 on 
the basis of the outcomes (xi, . . . , x„). 

There are many forms in which such an inference could 
be casted. For example, it could take the form of a point 
estimator, i.e., by designing a function t : X" i— >■ Q such 
that t(xi, . . . ,x„) would represent the best estimate of 
9. These details need not concern us now because the 
concept of sufficiency is equally important in all such 
approaches. 

To introduce the notion of sufficiency it is useful to 
consider a set of n independent functions {yi,...,j/„} 
defined on X", such that the equations 

yi{xi,...,Xn) = Ci 



yn(a;i,- 



(8) 



always have a unique solution. Thus, one might simply 
think of (j/i, ...,?/„) as an alternative coordinate system 
that charts the points in X" just as well as (xi, . . . , a;„). 
Clearly, it is then equivalent whether one provides the 
recorded data in terms of the original values (a;i . . . , x„) 
or in terms of the alternative coordinates (yi,...,y„). 
The probability distribution can be transformed to the 
alternative coordinates: 



ox ' 
Pe{yi,---,yn) =pe{xi,...,Xn)\-Q^\ 



(9) 



where the last factor represents the Jacobian of the trans- 
formation. 

Now suppose that pe has the following form 



Peivi, .■.,yn)= Pe{yi)9{yi, • • • , ?/«)• 



(10) 



In that case, the function yi is said to be sufficient for 9. 

Of course, there are many choices for such an alterna- 
tive coordinate system. It is straightforward to show that 
if j/i out of the set {yi, . . . ,yn} is sufficient for 9, then, 
the same will hold for an alternative set {j/i, 7/2, • ■ • , yn}, 
provided that this also gives a regular coordinate system. 

The intuitive idea behind this notion of sufficiency is 
that all the information that the data provide about the 
unknown value of 9 is in this case really contained in 
2/1 alone, because the probabilities of the values of the 
remaining variables y2,---,2/n ^re insensitive to 9. In 
other words, whatever form our inference about 9 is going 
to be, it seems reasonable to make it depend only on the 
value of yi , since all the other data are irrelevant for this 
purpose. Of course, if a sufficient statistic can be found 
this greatly simplifies the problem of statistical inference 
because the number of relevant data can then be reduced 
from n to 1. 

In Fisher's own words, the criterion of sufficiency is: 



"That the statistics chosen should summarize 
the whole relevant information supplied by 
teh sample. [. . . ] In mathematical language 
we may interpret this statement by saying 
that if 9 is the parameter to be estimated, 9i 
a statistic which contains the whole of the in- 
formation as to the value of 9 which the sam- 
ple supplies, and ^2 any other statistic, then 
the surface of distribution of pairs of values 
9i and ^2 for a given value of 9 is such that 
for a given value of 9i , the distribution of ^2 
does not involve 9. In other words, when 9i 
is known, knowledge of the value of ^2 throws 
no further light upon the value of 6*." Fisher 
(1922) [H, p. 317]. 

It might be worthwhile to illustrate this by a simple 
example. Suppose we have a real- valued outcome x and a 
collection of normal probability distributions which differ 
only in their location parameter, e.g.: 



Peix) = 



1 



,-(x-9)V2 



/27r 



It is well-known that for this case, the function 



2/1 



1 " 



(11) 



(12) 



provides a sufficient statistic for 9. In general, however, 
the class of probability distributions for which a sufficient 
statistic exists is severely limited; the Pitman-Koopman 
theorem implies they exist only for the exponential fam- 

iiy- 

It useful to slightly rewrite the condition of sufficiency 
([T0| . Using ([9]) , we can rewrite it into pg{yi,y2, ■ ■ ■ ,yn) — 
Pe{yi)g{yi, ■ ■ ■ , j/n)- Next, using the definition of condi- 
tional probability this gives 



P9{y2, ■ ■ ■ ,yn\yi) = 3(2/1,..., ?/„), 



(13) 



which states that if j/i is sufficient for 9 then the condi- 
tional probability pg{y2, ■ . ■ , yn\yi) must be independent 
of 9 (because 5(2/1, . . . , 2/n) is), and thus 



Pe(2/2 



,2/«|yi) =p(2/2,---,2/«l2/i)- 



(14) 



This alternative formulation of sufficiency shows that 
once 2/1 is given, the rest of the data (i.e., {2/2, ■ • • , 2/n}) 
is irrelevant to 9. 

For our purposes the following notes are of crucial im- 
portance. Note firstly that notions of locality or causa- 
tion can be kept safely on the bench in this approach. 
For example, it might be that 9 labels various races of 
tomato plants, and x the weight of a tomato produced 
by a such a plant. 

Secondly, Fisher's talk about 'information' should be 
understood in the following sense. It refers to informa- 
tion that is 'contained' in a sample and that is 'about' 
something. To say that 2/1 is sufficient is qualified by say- 
ing that it is sufficient for a purpose (inferring the value 
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of 9, it may well be insufBcient for other purposes!) and 
relative to a class Ry, namely all other statistics of the 
same sample space, i.e., all other functions of 2/2, ■ • ■ , ?/« 
of the outcome space X", i.e., Ry = {j/i, . . . , j/„}. 

So we see that this framework naturally incorporates 
the two aspects mentioned in subsection IIII A[ namely 
that sufficiency of A should be characterised as (i) for a 
specific purpose, the purpose beign to render some other 
variable redundant and (ii) relevant to a specific class of 
variables. See the requirements a, /3i and /32 on page[7l 

Finally, some words about its status. In mathemati- 
cal statistics, one might feel, sufficiency is actually just 
a name for a particular definition. As such, one may 
ask, how can this be of any help to foundational or con- 
ceptual problems in physics? However, even in statistics 
sufficiency is much more than just a definition. It is a 
desideratum. But it can be turned into something much 
more powerful by formulating the demand that, for cer- 
tain physical probabilistic or stochastic theories, certain 
variables should be sufficient (in the sense given by the 
definition above) for a particular purpose relative a spe- 
cific class of variables. To this we will now turn. 



1. Bayesian inference 

The above approach used the orthodox formulation of 
statistical inference, in which parameters are kept strictly 
distinct from outcomes. The basic reason for this division 
is that while a statistical model provides probabilities for 
the events or outcomes x, there is usually not a corre- 
sponding probability for parameters, i.e., they are usually 
not random variables but labels for the probability dis- 
tributions indicating settings of measurement apparata. 

At first sight this dichotomy between parameters and 
outcomes in orthodox statistical inference corresponds 
nicely to the same dichotomy adopted above in section 
IIII B I between the theoretical roles played by the settings 
a, h and outcomes A, Bl However, in the context of Bell's 
notion of local causality we are not aiming at making an 
inference about the settings a or h. Therefore we will 
have to change the perspective somewhat. 

There is an alternative approach to statistical infer- 
ence, the Baysian approach, that does not rely on such a 
strict division between events and parameters. 

Bayesian statistical inference proceeds from a similar 
point of view except that now one assumes the existence 
of a so-called prior probability distribution over the pa- 
rameter 0. Furthermore, the probability distributions 
Pe{x) are now reinterpreted as conditional distributions 



Pe{x) ^p(,x\e) . 



(15) 



Given these two assumptions, it is possible to provide a 
so-called posterior probability distribution by means of 
Bayes' theorem, i.e.. 



p{9\x) = 



P{x\0)p{0) 

Jdepix\e)pie) 



(16) 



Extending this to the case of multiple, independent 
and identically distributed trials one obtains: 



p{0\xi,...Xn) = 



UtlPe{^^)p{0) 

Im=lPeix^)pme 



(17) 



In the Bayesian approach, the goal of statistical infer- 
ence is to report this posterior probability distribution. 
Note that this viewpoint does not necessarily presuppose 
a subjective interpretation of probability. 

As stated before, sufficiency is an important notion re- 
gardless of which precise approach to statistical inference 
is chosen. In the Baysian approach this takes the follow- 
ing form. Firstly, note that the transformation ^ re- 
mains valid. Secondly, consider p{9\yi, . . . , z/n) and note 
that using Bayes Theorem this is equal to 



p{y,,...,yn\0)p{0) 
Jpiyr,...,yn\e)p{e)de 

Then assuming yi to be sufficient for 
^^ obtains, we deduce that 



(18) 



I.e., assummg 



p{0\yi,...,yn) = 



pe{yi)giyi,---,yn)pi0) 



Ipe{yi)g{yi,-- 

pe{yi)p{d) 
J Peiyi)pi9)d9 



,yn)pi0)d0 



(19) 



Thus once the fundamental assumptions of Bayesian in- 
ference are in place, the sufficiency condition is entirely 
equivalent to 



p{0\yi,...yn) =p{0\yi) 



(20) 



Perhaps, this makes the underlying motivation of suffi- 
ciency even clearer: if the above condition holds, then the 
probability of 0, once j/i is given, is not changed when 
the values of y2 , • ■ • , 2/n are included. These additional 
functions of the data are irrelevant or redundant for the 
purpose of assigning the posterior probability. In accor- 
dance with the above terminology, this is expressed as: 
j/i is sufhcient for the purpose of rendering some set of 
other variables redundant relative to a class of variables 



Ry 



{yi,...,y„}. 



It should also now be clear how this condition would 
fit in with the usual Bell-type framework we have been 
sketching in the section [H] (e.g., see Fig. [3]). First of all, 
we will suppose the condition (|20p to hold for each of 
the probability distributions labeled by the settings a, b. 
Secondly, A will play the role of yi, and each of the yt 
(i y^ 1) should be one of the other random variables in 
the candidate theory in question, for example one of the 
settings A, B, or some other beable specification A'. The 
role Ry is thus taken over by R\ . Thirdly, is one of the 
outcomes A, B. 

Now, although obviously A is not a datum nor a func- 
tion of the data, it is still assumed to be a random vari- 
able. It has a value, and although it will in general be 
hidden for us we can reason about the hypothetical case 
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that we would know its value. Perhaps one might won- 
der if the identification 9 with the outcomes A,B is a 
valid move to make. After all, we have introduced the 
variables yi , . . . , y„ as functions on the space of outcomes 
X", whereas labelled probability distributions. But in 
a Baysian approach this distinction does not count as 
fundamental anymore. As is clear from ()20p we are ba- 
sically dealing with probability distributions on 8 x X", 
and both and yi can be seen as functions on this larger 
space. 

By now we have gathered enough mathematically clean 
and precise tools to explicate Bells' intuitive notion of 
local causality, and this will be carefully outlined in the 
next section. 



V. THE BABY: BELL'S LOCAL CAUSALITY 
'MATHEMATICALLY SHARP AND CLEAN' 



section this entails that functional sufficiency ^ renders 
the label h and statistical sufficiency (PH)) the random 
variable B redundant for prediction of Pa,&(A|_B, A). For 
determining the probability of obtaining outcome B a 
completely analogous analysis obtains so that we finally 
get the mathematically sharp and clean formulation of 
the condition of local causality: 



Pa,M\B,X)^Pa{A\\), 
Pa,b{B\A,X)^PbiB\\), 

from which one trivially obtains factorisability: 

Pa,b{A,B\X) ^ Pa{A\X) Pb{B\X). 



(21) 



(22) 



Although rather similar to ([TJ , which in section [II] was 
claimed to be the mathematical expression of the condi- 
tion local causality, the differences and alternative deriva- 
tion are crucial. 



Recall that our analysis of sufficiency showed that such 
a requirement for physical theories implies that certain 
variables should be sufficient (in one of the two senses 
given above) for a particular purpose relative to a spe- 
cific class of variables. We have argued that then © 
and (j20p can be obtained as mathematical criteria when 
dealing with, respectively, labels attached to functions 
and random variables that are to be conditioned on. 

In Bell's conditionn of local causality it is the beable 
specification A that is supposed to be sufficient. Thus in 
order to analyse this condition as a sufficiency criterion, 
we must, firstly, indicate the purpose for which A is suffi- 
cient (per requirement /3i and P2) and, secondly, indicate 
relative to which beable class this is so, i.e., to indicate 
the range Rx of allowable A (per requirement a) . 

We believe that by now it should come as no surprise 
that our proposal is the following: Consider again Fig. 
[21 A candidate theory will be said to be locally causal 
when, if region 1 is space-like separated from region 2, 
the theory provides a specification A in a region 3 that 
shields off region 2 from the overlap of the back-ward 
light cones of 1 and 2, that is sufficient, in the sense of 
(|ni) and (PO)) . for the purpose of rendering the beables in 
the far-away region 2 redundant for the task of predicting 
the probabilities of events in region 1, where A is relative 
to the class of all allowable beable specifications that can 
be given about a region 3 (according to the candidate 
theory and consistent with the 'free variables' assump- 
tion), i.e., R\ contains all allowable beable specifications 
in this region 3. 

If we now apply this to the standard bi-partite setup of 
Fig. 13] we obtain that if the candidate theory in question 
obeys local causality then the theory provides a beables 
specification A in region 3a that is sufficient for the pur- 
pose of rendering the far-away outcome B and setting b 
redundant for the task of determining the probabilities of 
obtaining A, and this is so relative to all other allowable 
beable specifications Rx the candidate theory in question 
provides for region 3a. Using the results of the previous 



VI. NOT THROWING OUT THE BABY WITH 
THE BATHWATER 

A. Remarks 

(one) All this might look like an overcomplicated way to 
obtain the already well-known, i.e., something similar to 
([T]), but we have in fact obtained quite a lot: a mathemat- 
ically clean formulation (l^lT) that brings to the forefront 
crucial aspects otherwise left out. The formalisation, 
firstly, encodes the particular notions of sufficiency and 
redundancy that are involved, secondly, incorporates the 
theoretical distinction between outcomes (random vari- 
ables) and settings (labels) enforced by the 'free variables' 
assumption, and finally, indicates rigourously where the 
constraints set by the notions of locality and causality 
enter the mathematical formalisation. 

This latter point has not been stressed before and 
needs some elaboration. Note that the mathematical 
formalisation of sufficiency itself needs no requirement 
what so ever of locality or causality, as can be deduced 
from the formal analysis of section IIVI We must thus 
look elsewhere, and here is our proposal. When applying 
both statistical and functional sufficiency to the setup of 
Fig. [3l requirements of locality and causation necessarily 
come in play when fullfilling the requirements (a) and 
(/3i) and (^2) of section UlI Al pageH 

Firstly, it is by an appeal to the principle that causality 
can only be local in the sense of Fig. [U that the purpose 
for which A is sufficient is specified. Indeed, because lo- 
cal causality stipulates that causes operating in a certain 
region in space-time must lie in the backward light cone 
of that region and effects of anything occuring in that re- 
gion can only lie in its forward light cone, we get the in- 
ference that anything outside the backward and forward 
light cone of that particular space-time region should be 
causally redundant. See Fig. [T]and the very first quote 
by Bell in section |lTl But in order to distinguish mere 
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correlation from causal influence, it is, secondly, that the 
range R\ of allowable beable specifications is restricted 
by considerations regarding locality and causality. The 
beables in R\ must lie in a spacetime region with specific 
characteristics, namely it must shield off the overlap of 
the light cones of region 1 and 2 from these same regions 
1 and 2. The possibility of spurious violations of local 
causality mentioned in remark (iii) on page 2] is in this 
way eliminated. 

In conclusion, local causality is not a mere statisti- 
cal no-correlations requirement; to the contrary, it has a 
substantial relation to the relativistic causal structure of 
Fig. [21 See also Norsen (2007, [l^) who extensively ar- 
gues for this point. According to our analysis presented 
here. Bell's condition of local causality is a special form of 
statistical and functional sufficiency, where the purpose 
for which A is claimed to be sufficient, and the range of 
validity (the class R\ of beable specifications to which it 
is relative) are motivated by locality and causality con- 
straints. 

It is important to realize that the notions of locality 
and causality referred to here involve nothing but the 
special relativistic causal structure as exemplified in the 
light cone structure of Fig. [1] Importantly, there is 
no need for some philosophical theory of causation or 
an appeal to the problematic notion of Reichenbach's 
Principle of the Common Cause, or the like (see also 
Norsen (2007, 115])). 

(two) It is only A that is supposed to be sufficient, and 
not A plus one or both of the settings. Of course the 
settings a and b are needed to determine the outcomes 
A and B respectively, but, remember, prediction of 
outcomes of measurement is not the purpose for which 
A is supposed to be sufficient! 

As was mentioned in footnote [51 Norsen (2007 [15J . 
2009 [16]) does include the local setting in the specifi- 
cation of the beables that are supposed to render some 
other beables in a space-like separated region redundant. 
However, this is not needed, and in fact even unwantectj. 
For, after all, what should be in R\? Only those beables 
whose causal past could be correlated, in the sense of 
Fig- [Band according to the candidate theory in question, 
to the causal past of the beables that are to be rendered 
redundant, i.e., the beables in region 2, such as B, b. Be- 
ing free variables the local settings should thus not be in 
Rx, as the following example shows. 

If including a in the beable specification of region 3 
could be relevant to render 6, B in region 2 redundant 
for prediction of the probability to obtain A, then 
there needs to be a genuine possibility in the candidate 
theory under study for correlations between the causal 
past of a and the events in region 2 (see Fig. [2] and 
Fig. [3]). However, this is excluded by the 'free variables' 



assumption. We can think of a being chosen at the 
very last instant, and therefore (in any locally causal 
theory) a can make no difference as to whether beables 
in space-like separated region 2 are, or are not redun- 
dant for prediction of obtaining outcome A in region 
1. Furthermore, from a mathematical point of view, 
including a in Rx makes no difference and is therefore 
better left out. 

(three) One could proceed in a two step proce- 
dure to obtain the mathematical formulation of local 
causality ([^T|) and the condition of factorisability (|22p . 
by firstly requiring statistical sufficiency ^ so as to 
obtain. 



Pa,b{A\B,X)=Pa.b{A\X), 
Pa,b{B\A, X) = PaM{B\X) , 



(23) 



and only then functional sufficiency (|20)) to subsequently 
get: 



PaAA\X) ^ PaiA\X) , 

PaMB\\) ^ PbiB\X) . 



(24) 



^^ Also, it should be mentioned that Bell himself explicitly refrains 
from doing this. See Fig. (6.6) of Bell (1990 [1, p. 108]). 



These two requirements together indeed imply (j2ip and 

It might be tempting to think of each of these two 
conditions (j23p and (|24)) as implementing a different 
weaker assumption than local causality itself. For after 
all it is the conjunction of the two that gives the desired 
condition (J21D . Logically this is indeed true. But despite 
this theoretical difference, the physical status of the two 
conditions is exactly the same. Both are a consequence 
of local causality, and the appeal to notions of locality 
and causality used in implementing the functional and 
statistical sufficiency are just the same, see remark 
(one) above. Elsewhere this will be further argued for 
(Seevinck & Uffink, 2010, [1^) and the comparison to 
a similar famous two-step procedure by Jarrett (1984, 
[III) and Shimony (1984, [ig) wih be there presented. 

(four) Orthodox quantum mechanics violates (j22l) . 
Indeed, as is well-known, the quantum mechanical 
predictions using the singlet state can be easily used to 
provide such a violation. A closer look reveals that the 
theory violates statistical sufficiency because it violates 
([23| , but it obeys (|24)) and thereby functional sufficiency. 

From this we can conclude that quantum mechanics 
does not provide a beable specification A in region 3 with 
the correct characteristics, i.e., the theory is unable to 
provide a specification of beables in any appropriate re- 
gion 3 such that the outcome B always becomes redun- 
dant for the probability of determining outcome A. 

It is tempting to draw more grand conclusions than 
this one, say, of a somewhat foundational, philosophical 
or meta-physical nature. We will refrain from doing 
that here, but in remark (six) below we will address the 
controversial question of what legitimate conclusions can 
be drawn from violations of local causality. 
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(five) The qualification of tlie class of beables R\ 
is essential because we must carefully ensure that no 
spurious violations of local causality are allowed for, 
and at the same time that we do not encode too much 
in the specification A so as to come into conflict with 
the 'free variables' assumption. Therefore, as was 
shown earlier, the specification cannot be taken to be 
complete, as some free variables must probably be left 
out, yet it must be allowed to range over all other 
beable specifications the candidate theory in question 
allows for. This justifies calling the specification of A 
'sufficiently complete', meaning that nothing causally 
relevant that is not 'free' from the candidate theory 
under consideration is left out, without claiming the 
specification to be complete in the sense that everything 
in the theory is included, including the free variables. 
This ensures that the spurious violations of remark (iii) 
and (iv) can not occur, and that local causality is not 
just a mere no-correlations condition. 



(six) Suppose (j2ip fails, how are we to interpret 
this? Well, given our remarks above, there seems to be 
only one option, namely that local causality is violated, 
i.e., there must be some non-local causation present in 
the candidate theory under study. 

We cannot blaim a violation of ([?!]) on the specification 
A not being sufficiently well-specified, for as was just ar- 
gued in remark (five) above, Rx ranges over all allowable 
beable descriptions and should be regarded 'sufficiently 
complete'. 

Nor can we blaim such a violation on the existence 
of 'locally explicable' correlations. The space-time struc- 
ture of Fig. [3]and the further specification of local causal- 
ity via the notion of sufficiency, including the require- 
ments (a), (/3i) and (/32), excludes any spurious viola- 
tions of ([HI) due to correlations that do allow a locally 
causal explanation. This we argued for in section Hill 

The question then arises what it (philosophically, 
meta-physically) means for local causality to be violated. 
Such an investigation will be performed elsewhere [ig . 



As a final remark we wish to present a point made by 
Norsen (2009, 0, p. 12] £3 concerning a violation of (^11) : 
"It isn't necessarily that something in region 2 is causally 
influencing something in region 1, or vice versa. It is 
always possible that there is some other event, neither 
in region 1 nor region 2, which was not determined by 
[A] , and which itself causally influences both [beables in 
region 1] and [in region 2]. The point is, though, that this 
causal influence would have to be non-local (i.e., would 
have to violate the special relativistic causal structure 
sketched in [Fig. [2]." 

VII. ENVOI 

Have we thrown out the baby with the bathwater? We 
believe not, as our cleaning up of the intuitive idea of 
local causality for mathematics has proven to be fruitful 
and to clarify hitherto unknown aspects of the notion 
of local causality. It should be seen as a complement 
to Norsen's manuscripts (2007 [11], 2009 ^) taking his 
analysis a step further. 

Also, we believe that our mathematical formalisation 
of Bell's notion of local causality further unearths its rich 
conceptual background, and that it thereby brings us a 
bit closer to answering the hard and open foundational 
questions that arise from attempts to incorporate viola- 
tions of local causality into our physical worldview. 

What critical light this novel mathematical formalisa- 
tion throws on other interpretation and motivations of 
Bell's notion of local causality is still to be worked out. 
This we hope to unearth in the near future (Seevinck & 
Uffink, 2010, (3). 



ACKNOWLED GMENTS 

MPS acknowledges very fruitful conversations with Hans 
Westman and Eric Cavalcanti and thanks the Centre for 
Time, Sydney, Australia for hosting him as a guest re- 
searcher. JU acknowledges fruitful discussions with Joe 
Henson and thanks the Perimeter Institute, Waterloo, 
Canada for generous hospitality. 



[1] Bell, J.S. (1964). On the Einstein-Podolski-Rosen Para- 
dox, Physics 1, 195. Reprinted in [g|, chapter 2. 

[2] Bell, J.S. (1976). The theory of local beables, Epistemo- 
logical Letters, vol. 9, March 1976. Reprinted in Dialec- 
tica 39, 85 (1985) and in ;8], chapter 7. 

[3] Bell, J.S. (1977). Free variables and local causality, Epis- 
temological letters, February 1977. Reprinted in [3|, 
chapter 12. 

[4] Bell, J.S. (1980). Atomic-cascade photons and quantum- 
mechanical nonlocality. Comments on Atomic and Molec- 
ular Physics 9, 121. Reprinted in j^, chapter 13. 



^^ See footnote |9] for the reason of mentioning it here. 



[5] Bell, J.S. (1981). Bertlmann's socks and the nature of 
reality. Journal de Physique, CoUoque C2 , suppl. au 
numero 3, Tome 42, 41. Reprinted in 8j, chapter 16. 

[6] Bell, J.S. (1984). Beables for quantum field theory, 
CERN-TH.4035/84. Reprinted in [8], chapter 19. 

[7] Bell, J.S. (1986). EPR correlations and EPW distribu- 
tions. In New Techniques and Ideas m Quantum Measure- 
ment Theory, New York Academy of Sciences. Reprinted 
in 8], chapter 21. 

[8] Bell, J.S. (1987). Speakable and unspeakable in quantum 
mechanics. Cambridge: Cambridge University Press. 

[9] Bell, J.S. (1990). La nouvelle cuisine. In A. Sarlemijn and 
P. Kroes (Eds.), Between Science and Technology (pp. 
97-115). Elsevier (North-Holland). 



14 



[10] Cartwright, N., Jones, M. (1991). How to hunt quantum 
causes. Erkenntnis, 35, 205-231. 

[11] Cavalcanti, E.G. (2008). Reality, locality and all that: 
"experimental metaphysics" and the quantum founda- 
tions. PhD thesis, the University of Queensland. arXiv: 
0810.4974 (vl, 28 Oct 2008) 

[12] Elby, A., Brown, H.R., Foster, S. (1993). What makes a 
Theory Physically "Complete"?, Found. Phys. 23, 971. 

[13] Fisher, R.A. (1922). On the mathematical foundations of 
theoretical statistics. Philosophical Transactions of the 
Royal Society, A, 222. p. 309368. 

[14] Jarrett, J. P. (1984). On the Physical Significance of the 



Locality Conditions in the Bell Arguments, Noiis 18, 569. 

[15] Norsen, T. (2007). J.S. Bell's Concept of Local Causality, 
larXiv:0707.0 401 (v2, 6 Jan 2010) 

[16] Norsen, T. (2009). Local Causality and Completeness: 
Bell vs. Jarrett, Found. Phys. 39, 273. 

[17] Seevinck, M.P. (2008). Parts and Wholes, PhD thesis. 
University of Utrecht. arXiv:0811.1027 (v3, 23 Apr 2009) 

[18] Seevinck, M.P., Ufiink, J. (2010). In preparation. 

[19] Shimony, A. in Proceedings of the International Sympo- 
sium: Foundations of Quantum Mechanics in the light of 
New Technology, (Tokyo, 1984). 



